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ABSTRACT 

Empirical  Orthogonal  Function  (EOF)  and  cross -correlation 
analysis  were  used  to  examine  the  interrelationships  between 
anomalous  wind  forcing  and  anomalous  sea  surface  temperature 

(SST)  over  the  North  Pacific  Ocean  for  the  period  January  1969 

3 
through  December  1978.   Wind  forcing  was  represented  by  uA  , 

friction  velocity  cubed,  and  curl  x,  wind  stress  curl,  as 
computed  from:   (a)  high-pass  filtered  (periods  less  than 
ten  days)  wind  components  only,  (b)  a  combination  of  highl- 
and low-pass  (periods  greater  than  ten  days)  filtered  wind 
components,  and  (c)  the  unfiltered  (total)  wind  components. 
EOF  analysis  was  applied  to  the  monthly  anomalies  to  extract 
"signal"  from  "noise."   Lag  correlation  analysis  was  used  to 
investigate  the  relations  between  anomalous  wind  values  and 
SST. 

It  was  found  that  both  u*"5  and  curl  t  correlation  maps 
show  significant  increases  in  correlations  at  zero  and  plus- 
one-month  lag.   (Large  areas  of  the  North  Pacific  Ocean  were 
correlated  at  the  32%  significance  level  and  several  smaller 
areas  at  10%.)   These  relationships  were  somewhat  stronger 

3  T 

for  u*   than  for  curl  l  and  stronger  for  the  storm-related 
(filtered)  parameters. 
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I.   INTRODUCTION 

This  study  is  a  statistical  investigation  using 
Empirical  Orthogonal  Function  (EOF)  analysis  and  cross 
correlation  analysis  to  examine  the  interrelationships 
between  anomalous  wind  forcing  and  anomalous  sea  surface 
temperatures  (SST)  over  the  North  Pacific  Ocean  from  20°N 
to  60°N  and  110°W  to  130°E  for  the  period  January  1969 
through  December  1978.   Particular  attention  is  given  to 
investigating  the  effect  of  surface  wind  components  having 
time  scales  of  ten  days  or  less.   The  primary  hypothesis 
to  be  tested  in  this  research  is  that  the  sea  surface 
temperature  responds  to  atmospheric  wind  forcing,  as  mani- 
fested in  storms,  and  that  the  amount  of  the  response  is 
proportional  to  the  amount  of  wind  forcing.   No  other  surface 
temperature  changing  processes  are  included  in  this  work. 
The  concept  for  examining  storminess  comes  from  the  strong 
relationship  that  exists  between  synoptic  storms  and  air-sea 
exchange  [Simpson  (1969),  Fissel,  Pond  and  Miyake  (1977), 
Elsberry  and  Camp  (1978)].   In  addition,  Davis  (1976),  who 
examined  the  statistical  relationship  between  monthly  sea- 
level  pressure  anomalies  and  SST  anomalies,  suggested  that 
anomalous  atmospheric  storminess  might  have  a  strong  corre- 
lation to  anomalous  SST. 
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The  approach  and  expectations  of  this  study  were  that: 
(1)  EOF  analysis  would  be  used  to  extract  the  essential 
features  from  the  complex  space-time  patterns  of  wind  and 
SST  data,  i.e.,  it  would  extract  the  "signal"  from  "noise" 
in  the  data,  (2)  correlation  analysis  would  be  used  to 
examine  relationships  between  the  variables  once  the  noise 
was  removed,  and  (3)  anomalous  storminess  values,  computed 
in  the  form  of  friction  velocity  cubed  and  wind  stress  curl, 
would  be  significantly  correlated  with  anomalous  SST  cooling. 

The  atmospheric  data  for  this  study  were  a  special 
series  of  six-hourly  surface  wind  analyses  prepared  by 
Fleet  Numerical  Oceanographic  Center  (FNOC) .  Friction 
velocity  cubed  (u*  )  and  the  vertical  component  of  the  curl 

of  the  wind  stress  (curl  x)  were  calculated  because  of  the 

^     z 

3 
established  relationship  between  uA   and  the  mechanical 

mixing  of  the  sea  by  the  overlying  atmosphere  and  the  well- 
established  relationship  between  curl  t  and  Ekman  convergence 

3 
and  divergence.   Monthly  mean  anomalies  of  u*   and  curl  t 

were  computed  using:   (a)  high-pass  filtered  (periods  less 
than  ten  days)  wind  components  only,  (b)  a  combination  of 
high-  and  low-pass  (periods  greater  than  10  days)  filtered 
wind  components,  and  (c)  the  unfiltered  (total)  wind  com- 
ponents.  The  partitioning  of  wind  parameters  into  these 
three  parts  was  designed  to  identify  the  effect  of  storms, 
both  in  isolation  and  in  the  presence  of  the  existing  mean 
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(low-pass)  flow.   Thus,  six  fields  were  created  to  represent 
the  anomalous  wind  influence  upon  the  ocean  surface.   EOF 
analysis  of  all  data  fields,  including  the  anomalous  SST 
values,  was  employed  to  extract  the  significant  and  almost 
noise-free  "geophysical  patterns."   The  resulting  anomalous 
wind  parameters  were  then  correlated  with  the  anomalous  SST 
data  at  the  time  lags  of  minus  three  through  plus  three 
months  to  investigate  possible  temporal  relationships. 

The  first  topic  considered  below  is  data  set  description 
and  analysis.   It  is  followed  by  a  discussion  of  EOF  analy- 
sis and  an  explanation  of  the  truncation  scheme  used  to  reduce 

noise.   Then  some  observations  are  given  on  the  lagged  corre- 

3 
lations  of  uA   and  SST,  and  curl  x  and  SST.   The  statistical 

significance  of  the  correlations  is  carefully  examined  and 

interpretations  are  made  in  light  of  that  significance. 

Finally,  some  conclusions  are  drawn  about  the  results 

produced. 
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II.   DATA  SET  DESCRIPTION  AND  ANALYSIS 

A.   ACQUISITION  AND  ANOMALY  COMPUTATION 
1 .   Sea  Surface  Temperature 

Monthly  sea  surface  temperature  (SST)  data  for  the 
period  January  1969  through  December  1978  were  obtained  from 
the  Scripps  Institution  of  Oceanography  Climate  Research 
Group  through  the  generosity  of  Jerome  Namias  and  Robert 
Born.   The  geographical  orientation  of  these  data  as  delin- 
eated in  Figure  1  was  a  nine  by  twenty- five  array  utilizing 
the  5°  square  grid  used  by  Namias'  group  which  extends  from 
20°N  to  60°N  and  110°W  to  130°E.   Due  to  missing  data  during 
the  120-month  record  at  some  of  the  225  grid  points,  only 
166  grid  points  were  retained  for  analysis.   The  small 
squares  and  circles  in  Figure  1  show  these  166  points.   At 
these  points,  only  79  discrete  values  were  missing  out  of  the 
total  19,920  possible.   These  missing  values  were  filled  by 
temporally  interpolating  between  the  preceding  and  following 
monthly  values.   All  SST  values  were  converted  from  Fahrenheit 
to  Centigrade  and  then  processed  to  compute  anomalies.   The 
computation  of  monthly  anomalies  from  all  data  sets  in  this 
study  was  accomplished  by  finding  the  difference  between  the 
observed  monthly  value  and  the  10-year  average  value  found 
at  the  same  grid  point  and  month  of  year  as  the  observation. 
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2.   Friction  Velocity  and  Surface  Stress 

In  addition  to  SST,  six  anomalous  surface  wind 
parameters  were  investigated.   Three  involved  u.   where  uA 
is  the  friction  velocity  and  three  involved  curl  x  where  t 

2 

is  the  surface  stress.   These  were  obtained  from  Haney, 
Risch  and  Heise  (1980)  ,  which  in  turn  was  based  in  part  on 
the  earlier  thesis  by  Risch  (1980).   They  used  a  special 
series  of  six-hourly  surface  wind  analyses  over  the  North 
Pacific  Ocean  prepared  by  Fleet  Numerical  Oceanographic 
Center  (FNOC)  covering  the  period  January  1969  through 
December  1978  to  analyze  for  synoptic  storm  activity. 
Temporal  variance  of  the  surface  wind  components  at  each 
grid  point  was  "high-pass"  filtered,  thereby  extracting  all 

temporal  variability  having  time  scales  less  than  ten  days. 

3 
The  three  uA   and  three  curl?x  data  sets  were  then  calcu- 
lated using,  respectively,  (a)  the  total  (unfiltered)  wind 
components;  (b)  the  high-pass  wind  components  only;  and 
(c)  a  combination  of  high-  and  low-pass  wind  components 
which  represent  the  effects  of  storms  (high-pass)  in  the 
presence  of  the  existing  mean  (low-pass)  field.   See  Haney, 
Risch  and  Heise  (1980)  for  details.   Having  calculated  the 
above  wind  parameters  at  every  six-hourly  observation  over 
the  entire  10-year  period,  monthly  means,  monthly  climatolo 
gies  over  10  years,  and  finally,  monthly  anomalies  were 
constructed.   The  monthly  anomalies,  covering  the  same 
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10-year  period  as  the  SST  anomalies,  for  the  six  wind- 
related  data  sets  were  then  interpolated  from  the  FNOC 
Northern  Hemisphere  Polar  Stereographic  grid  to  the  same 
latitude-longitude  grid  points  as  the  SST  anomaly  data. 

B.   DETRENDING  AND  STANDARDIZATION 

The  seven  data  sets  representing  monthly  anomalies  of 

3  3 

uA  Fm,  curl^xFm  [both  from  total  wind  components),  u^  Fl, 

curl  xFl  (both  from  high-pass  wind  components  only),  uA  F2, 

curl  tF2  (both  from  high-  and  low-pass  wind  components) 

and  SST  were  then  detrended  and  standardized.   The  detrend- 

ing  equation  was  of  the  form 


Y   =  y   -  (at   +  b) 
m   J  m    v   m 


where  Y   is  the  detrended  value  at  the  m    time  observation, 
m 

y   is  the  original  observation,  a  is  the  slope,  b  is  the 

^m  ° 

intercept  and  t   =  (m  -  l)At,  with  At  =  one  month,  being 
the  time  interval.   This  detrending  was  performed  to  remove 
all  linear  trends  in  the  anomalies. 

Standardization  of  the  seven  data  sets  was  completed 
by  using  an  equation  of  the  form 


X  =  !S_^JL 
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where  X  is  the  standardized  value,  x  is  the  detrended  value 
of  the  anomaly  at  a  particular  grid  point,  x  is  the  time 
mean  value  of  x,  and  s  is  the  standard  deviation  of  x  -  x. 
The  use  of  standardized  variables  assures  that  each  variable 
at  each  grid  point  is  of  equal  importance  in  determining 
the  form  of  the  field  for  comparison  to  other  variable 
fields.   Additionally,  standardized  anomaly  fields  of  sea 
surface  temperature  and  wind  parameters  correspond  more 
closely  to  the  climatological  classifications  of  normal, 
above  normal,  much  above  normal,  etc.,  than  do  the  anomaly 
fields  themselves  [Kutzbach,  1967].   Standardization  is 
required  in  order  to  utilize  the  EOF  method  of  extracting 
"signal"  from  "noise,"  and  it  does  not  affect  the  subsequent 
correlation  analysis. 

The  standard  deviation  maps  of  all  seven  data  fields  are 
included  in  Figures  2  through  4.   Figure  2  shows  that  the 
centers  of  activity  for  the  three  uA   data  sets  occurs 
eastward  of  Japan  at  about  45°N  and  165°E.   The  largest 
changes  of  the  curl  fields  in  Figure  3  appear  to  be  located 
about  50°N  and  170°E.   Finally,  the  standard  deviation  pattern 
of  the  SST  data  is  indicated  in  Figure  4  with  the  most  active 
area  due  east  of  Japan,  45°N  and  150°E. 
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Figure  2.   Contours  of  the  standard  deviation  fields 

3 
of  the  uA   data. 
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Figure  3.   Same  as  Fig.  2  except  for  curl  data. 
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Figure  4.   Same  as  Fig.  2  except  for  SST  data. 
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III.   CALCULATIONS  PERFORMED 

A.   EMPIRICAL  ORTHOGONAL  FUNCTION  ANALYSIS 

This  research  used  empirical  orthogonal  function  analysis 
to  reduce  a  large  number  of  highly  correlated  variables  to 
a  small  number  of  independent  variables.   These  are  linear 
combinations  of  the  original  ones  and  contain  a  large  per- 
centage of  the  total  variability.   This  technique  has  been 
used  extensively  in  meteorology  and  oceanography  [see,  e.g., 
Kutzbach,  1967;  Stidd,  1967;  Davis,  1976;  Weare,  1977; 
Harr,  1979]  and,  unlike  most  other  orthogonal  representations, 
this  technique  does  not  require  a  predetermined  function, 
such  as  the  Fourier  components,  sine  and  cosine,  but  depends 
upon  the  spatial  and  temporal  interrelationships  within  the 
data  being  analyzed. 

Kutzbach  (1967)  provides  a  lucid  outline  of  the  mathe- 
matical procedure  necessary  to  define  the  orthogonal 
functions  and  their  coefficients.   A  covariance  (or  correla- 
tion) matrix  must  be  formed  from  the  data  set.   To  maximize 
the  variance  explained  by  the  new  functions,  one  must  solve 
the  system  of  linear  equations  associated  with  this  co- 
variance  matrix  for  a  set  of  eigenvalues.   The  eigenvalue 
represents  the  fraction  of  the  total  variance  in  the  original 
data  field  explained  by  its  corresponding  eigenfunction .   A 
function  is  found  by  substituting  a  given  eigenvalue  into 
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the  system  of  equations  and  solving  with  the  aid  of  an 
additional  normalization  condition.   Just  as  the  sines  and 
cosines  of  Fourier  analysis  are  space -orthogonal ,  the  eigen- 
functions  produced  by  EOF  analysis  are  also  space-orthogonal 
Associated  with  each  function  is  a  (time)  coefficient  which, 
when  combined  with  the  function  and  the  result  summed  over 
all  functions,  provides  an  exact  reproduction  of  the 
original  field.   That  is, 


120 
X(x,t)  =   Z   A  (t)EOF  (x) , 
m=l 


where  EOF^  is  the  eigenfunction ,  and  A   is  the  time  coefficient 

m  °  '      m 

related  to  the  m   eigenvalue.   These  coefficients,  which 
are  time -orthogonal  [Kutzbach,  1967],  are  obtained  from  the 
functions  and  the  original  anomaly  field.   The  specific 
formulations  of  the  mathematics  described  here  are  detailed 
in  the  Appendix. 

Figure  5  shows  the  cumulative  percentage  of  variance 
explained  versus  eigenvalue  mode  number  for  the  seven  sets 
of  data  examined  in  this  study.   For  all  the  seven  data 
fields,  the  cumulative  percent  variance  tends  to  increase 
logarithmically  with  the  mode  number.   Mode  number  one  is 
that  spatial  pattern  which,  in  a  least  squares  sense, 
explains  the  most  variance  possible  (averaged  over  all  grid 
points  and  time).   That  variance  is  between  9  and  191  of  the 
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Figure  5.   Cumulative  percentage  of  total  variance  versus 
eigenvalue  mode. 
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total  variance  depending  on  which  of  the  seven  fields  is  in 
question.   It  is  significant  that  the  cumulative  variance 
explained  by  the  SST  and  u*   modes  rises  more  rapidly  than 
that  of  the  curl^x  modes.   This  means  that  the  major  part  of 
the  SST  and  u*   fields  can  be  explained  by  a  few  spatial 
modes  while  the  curl  variance  is  distributed  more  equally 
among  a  greater  number  of  modes.   The  next  section  describes 
the  method  of  determining,  for  each  data  variable,  how  many 
modes  to  retain  for  analysis. 

B.   TRUNCATION  METHOD  AND  SIGNIFICANCE  LEVEL 

Various  investigators  [Beale  e_t  al.  ,  1967;  Craddock  and 
Flood,  1969;  Farmer,  1971;  Dyer,  1975;  Rinne  and  Jarvenoja, 
1979]  have  suggested  numerous  methods  for  determining  the 
point  of  truncation,  i.e.,  finding  the  sufficient  number  of 
components  of  the  series,  given  by  the  empirical  orthogonal 
functions,  to  be  retained  for  analysis.   The  choice  of  method 
depends  largely  on  the  purpose  for  which  the  empirical 
orthogonal  function  expansion  is  to  be  used.   In  this  study, 
empirical  orthogonal  function  analysis  is  used  to  reduce 
"noise."   The  term  noise  is  defined  here  as  that  variability 
or  randomness  confined  to  small  space  scales,  purely  local 
effects,  and  observational  errors.   Recent  numerical  experi- 
ments by  Preisendorfer  and  Barnett  (1977)  indicate  that  the 
classical  asymptotic  theory  of  random  eigenvalues  is  inade- 
quate to  provide  significance  tests  when  the  number  P  of 
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spatial  data  points  is  in  the  range  of  10  to  100  (normally 
encountered  in  geophysical  practice)  and  when  the  historical 
sample  size  N  is  in  the  similar  range  of  10  to  100.   In  the 
present  study  P  =  166  and  N  =  120.   Their  experiments  revealed 
that  the  asymptotic  theory  takes  hold  only  for  N  on  the  order 
of  1000  or  more  for  the  above  P  values.   After  numerous 
experiments,  Preisendorf er  and  Barnett  have  developed  an 
empirical  procedure  for  empirical  orthogonal  function  trunca- 
tion in  the  case  of  small  sample  sizes  based  on  analysis  of 
various  arrays  filled  by  Gaussian  random  variables  of  unit 
variance,  zero  mean.   The  procedure  provides,  for  a  given  P 
and  N,  the  significant  confidence  band  for  the  randomly 
generated  eigenvalues,  thus  providing  the  objective  method 
of  identifying  the  "noise"  in  a  given  data  set.   Tim  Barnett 
generously  computed  and  provided,  for  the  specific  N  and  P 
values  in  this  study,  the  5  and  95%  confidence  limits  of 
eigenvalues  for  modes  one  to  twenty- five .   Figure  6  demon- 
strates the  use  of  this  truncation  process  for  SST  eigen- 
values.  The  figure  shows  eigenvalues  plotted  against  mode 
number  on  a  semi- logarithmic  grid.   The  first  eigenvalue  is 
always  the  largest,  since  the  first  EOF  explains  the  most 
variance.   The  two  solid  curved  lines  labeled  95%  and  5% 
represent  the  indicated  range  of  the  eigenvalues  obtained 
by  Barnett  from  one  hundred  repeated  calculations  using  166 
by  120  arrays  of  random  variables  of  unit  variance.   Thus, 


25 


Figure  6.   Eigenvalues  of  SST  versus  mode  number.   The  95% 
band  indicates  the  truncation  point  (from 
Preisendorfer  and  Barnett). 
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out  of  the  100  repeated  experiments,  95%  of  the  eigenvalues 
fell  within  the  rather  narrow  band  indicated  by  the  two 
curves,  with  5%  above  the  top  curve  and  5%  below  the  bottom 
curve.   The  e's  represent  the  eigenvalues  of  the  SST  anomaly 
field,  also  standardized  and,  therefore,  having  unit  variance. 
Notice  that  the  arrays  of  random  numbers  have  the  variance 
spread  rather  evenly  throughout  the  various  modes  while  the 
SST  field  has  much  of  its  variance  in  the  first  ten  or  twenty 
out  of  the  total  of  120  modes.   According  to  Preisendorfer 
and  Barnett  (1977)  ,  the  method  of  extracting  the  "geophysical 
signal"  from  the  SST  data  is  to  keep  those  empirical  orthogonal 
function  modes  with  eigenvalues  lying  above  the  95%  band 
created  by  the  random  data  fields.   In  this  case,  only  5% 
of  the  retained  field  is  likely  to  be  attributable  to  random 
noise.   In  the  case  of  the  SST  anomaly  data,  the  first  11 
modes  were  retained.   This  truncation  procedure  was  applied 
to  each  data  field  with  the  results  shown  in  Table  I .   Table  I 
shows,  for  each  data  field,  the  number  of  empirical  orthogonal 
function  modes  which  were  retained  and  the  cumulative  percent 
of  the  total  variance  which  those  modes  explain. 

Note  that  the  data  used  in  this  study  have  a  rather  large 

3 
amount  of  noise.   The  SST  and  u*   anomaly  fields  are  about 

75%  signal  and  25%  "noise,"  while  the  curl  x  anomaly  fields 

are  only  62  to  65%  "signal."   There  is  more  "noise"  in  the 

curl  field  because  it  is  a  spatial  derivative  of  a  quantity 
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(stress)  which  contains  small-scale  errors  and  large-scale 
geophysical  variation. 

Figure  7  is  a  comparison  between  the  SST  values  at  an 
individual  grid  point  before  and  after  EOF  truncation. 
Curve  (A)  represents  the  detrended  and  standardized  SST 
values  from  grid  point  number  84,  located  one-third  of  the 
way  between  Hawaii  and  the  Aleutian  Islands.   Curve  (B) 
represents  the  resultant  values  derived  from  the  truncated 
EOF  series  for  the  same  grid  point.   It  should  be  noted  that 
noise  is  reflected  in  (A)  by  the  curve's  rapid  small-scale 
fluctuations.   It  is  readily  apparent  that  curve  (B)  is 
a  damped  form  of  curve  (A).   A  considerable  amount  of  the 
small-scale  fluctuations  have  been  eliminated  with  curve  (B) 
still  retaining  the  large-scale  trends  of  curve  (A) . 

In  addition  to  the  method  described  above,  various  other 
ways  have  been  suggested  for  determining  the  point  of  trunca- 
tion of  the  series  given  in  EOFs .   Rinne  and  Jarvenoja  (1979) 
have  offered  some  of  their  own  methods  and  summarized  many 
others.   These  methods  included:   retaining  eigenvalues  which 
explain  more  than  one  percent  of  the  total  variance, 
retaining  eigenvalues  that  are  much  larger  than  others, 
applying  a  statistical  criterion  based  on  a  Bartlett's  test 
to  determine  the  values  to  be  saved,  and,  finally,  keeping  all 
the  largest  eigenvalues  which  together  account  for  99%  of  the 
total  variance.   Another  truncation  test  mentioned  in  this 
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summarization  is  one  which  uses  a  diagram  drafted  similar  to 
Figure  6  in  this  paper.   The  procedure  truncates  the  EOF 
series  at  the  largest  eigenvalue  at  which  the  eigenvalue 
(versus  mode  number)  curve  can  be  approximated  with  a  straight 
line.   In  Figure  6  the  straight  line  approximation  would 
start  best  at  mode  number  twenty-four  and  thus,  according  to 
this  method,  the  largest  twenty-three  eigenvalues  would  be 
retained. 

All  of  the  above  methods  would  have  retained  more  modes 
than  the  Preisendorf er  and  Barnett  scheme  which  was  used  in 
this  study.   The  choice  of  this  scheme  can  best  be  explained 
by  using  a  schematic  of  Rinne  and  Jarvenoja  (1979)  shown  in 
Figure  8. 

The  whole  box  represents  the  total  variance  and  this 
consists  of  real  environmental  information  (areas  1  and  2) 
and  noise  (areas  3  and  4) ,  separated  by  the  interior  vertical 
line.   EOF  series  truncation  controls  the  placement  of  the 
sloping  broken  line  which  divides  the  box  into  an  explained 
(areas  1  and  3)  and  unexplained  (areas  2  and  4)  portion.   The 
numbered  areas  are  now  identifiable  as:   (1)  real  information 
explained  by  EOFs ,  (2)  real  information  EOFs  fail  to  explain, 
(3)  noise  explained  by  EOFs,  and  (4)  noise  that  EOFs  fail  to 
explain.   (A)  represents  the  situation  when  a  relatively 
small  number  of  EOFs  are  retained  while  (B)  occurs  when  a 
relatively  large  number  is  saved.   The  Preisendorfer  and 
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B 

Figure  8.   Box  interpretation  of  the  total  variance  where 
A  is  the  case  of  saving  a  small  number  of  terms 
and  B  is  the  case  of  retaining  a  large  number 
of  terms  in  the  EOF  series  expansion. 
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Barnett  (1977)  method  used  in  this  study  is  closely  represented 
by  (A) .   Very  little  noise  is  retained  at  the  "expense"  of 
explaining  a  somewhat  reduced  amount  of  total  variance.   In 
the  following  section,  possible  relationships  between  the 
seven  truncated  anomaly  data  sets  are  examined. 
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IV.   RESULTS 

A.   LAG  CORRELATIONS  BETWEEN  u*3  AND  SST 

Figures  9  to  15  are  maps  of  lagged  correlations  between 
u*   and  SST  anomalies  (detrended,  standardized,  and  truncated) 
over  the  North  Pacific  Ocean.   The  map  series  steps  through 
lags  of  minus  three  to  plus  three  months,  in  one -month 
increments,  with  Figure  9  showing  the  minus -three -month  case 
and  Figure  15  ending  the  series  with  a  plus -three -month  lag 

between  uA   and  SST.   In  the  map  series,  SST  is  lagged 

3 
relative  to  u*  .   Thus,  the  map  with  a  lag  of  minus  three 

months  shows  the  correlation  between  the  uA   field  and  the 

SST  field  that  occurred  three  months  before  the  u*  .   In  the 

opposite  situation,  the  map  lagged  plus  three  months  shows  the 

correlation  between  the  u*   field  and  the  SST  field  that 

occurred  three  months  later.   Each  figure  contains  three 

maps,  which  are,  from  top  to  bottom,  correlations  between 

SST  and  (1)  u*  Fl  computed  from  high-pass  wind  components 

only,  (2)  uA  F2  computed  from  a  combination  of  high-  and 

low-pass  wind  components,  and  (3)  uA  Fm  computed  from 

unfiltered  (total)  wind  components,  respectively.   Finally, 

each  map  is  shaded  gray  in  areas  of  negative  correlation 

values  . 

In  anticipation  that  anomalous  storminess  values  are 

related  to  anomalous  surface  cooling,  it  was  expected  that, 
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Figure  9 


Correlation  maps  of  the  uA   anomaly  data 
fields  with  SST  at  a  lag  of  minus  three 
months.   The  last  closed  isoline  is  labeled 
in  some  regions  having  the  largest  correlations 
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Figure  10.   Same  as  Fig.  9  except  lagged  minus  two  months 


36 


USTAR°F1 


USTAITF2 

60  i r^=^ 


USTAITFm 


140E         160  180  160  140         120W 


Figure  11.   Same  as  Fig.  9  except  lagged  minus  one  month 
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Figure  12.   Same  as  Fig.  9  except  no  lag  between  data  sets 
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Figure  13.   Same  as  Fig.  9  except  lagged  plus  one  month. 
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Figure  14.   Same  as  Fig.  9  except  lagged  plus  two  months 
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Figure  15.   Same  as  Fig.  9  except  lagged  plus  three  months 
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for  lags  greater  than  or  equal  to  zero,  maps  would  be  mostly 

covered  by  negative  correlation  values.   Examination  of 

3 
this  uA   series  of  maps  reveals  that  a  significant  change 

occurs  in  the  zero  and  plus -one -month  lag  maps.   There 

is  a  visible  increase  in  the  gray  shading,  i.e.,  negative 

correlations,  at  these  lags.   Table  II  delineates  some  map 

parameters  that  can  be  used  to  quantify  this  increase  in 

3 
the  negative  correlation  area  when  SST  lags  uA  .   Almost 

without  fail  there  are  significant  changes  in  going  from 
the  minus -one -month  lag  to  the  plus  one.   For  instance, 
the  sum  of  the  negative  correlations  (RNEG)  for  uA  F2 
goes  from  -10.81  at  minus -one -month  lag  to  -30.19  at  plus- 
one-month  lag.   The  number  of  grid  points  with  negative 

correlations  (NNEG)  jumps  from  99  at  minus  one  to  143  at 

3 

zero  and  then  up  to  147  at  plus -one -month  lag  for  uA  F2. 

The  significance  of  this  jump  can  be  appreciated  by  con- 
sidering a  random  correlation  situation.   In  such  a  case, 
83  grid  points  would  be  expected  to  have  positive  correla- 
tions and  83  would  be  expected  to  have  negative  correlations, 
with  a  standard  deviation  of  6.5  grid  points.   This  is 
strictly  true  only  if  the  grid  points  are  independent  of 
each  other.   This  means  that  68%  of  the  time  the  number  of 
negative  points  would  lie  between  76.5  and  89.5  and  95%  of 
the  time  the  number  of  negative  points  would  lie  between  70 
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and  96.   Thus,  the  increase  of  negative  grid  points  from  99 
to  147  is  of  notable  significance. 

Before  continuing  beyond  this  point,  the  issue  of 
significance  must  be  examined  further.   An  issue  of  signifi- 
cance \\ras  raised  and  resolved  earlier  in  this  study  when 
considering  where  to  truncate  the  EOF  series  to  ensure 
maximum  signal  and  minimum  noise.   Thus,  the  correlations 

presented  above  are  those  associated  with  the  geophysical 

3 
fluctuations  in  SST  and  u*   (i.e.,  the  "signal")  and  are  not 

attributable  to  error  "noise."   The  paragraph  preceding  this 
one  noted  the  significance  of  the  changes  that  occurred  in 
the  correlation  between  negative  and  positive  lags  and  pro- 
vided mathematical  evidence  for  this  significance.   The 
final  significance  to  be  examined  is  related  to  the  existence 
of  artificial  correlation  due  to  the  interplay  between  the 
length  of  record  examined  and  the  dominant  time  scales  of 
variation  of  the  processes  being  studied. 

As  a  first  approximation  to  the  magnitude  of  this 
possible  artificial  correlation,  six  grid  points  were  chosen 
(circles  in  Fig.  1)  and  their  temporal  history  of  SST,  com- 
puted after  EOF  truncation,  was  examined  for  repetitive 
events.   (See  Figures  16  through  21.)   SST  data  were  chosen 
to  estimate  the  magnitude  of  the  possible  spurious  correla- 
tions because  it  has  relatively  longer  time  scales  of  varia- 
bility compared  to  the  atmospheric  fields.   Table  III  shows 
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the  values  obtained  from  these  calculations.   The  number  of 
independent  events,  n,  was  estimated  by  averaging  the  number 
of  peaks  with  the  number  of  troughs  in  the  time  series. 
Dividing  120  months  by  n  provided  the  average  event  length. 
The  formula  for  Rcrit,  the  critical  correlation  value,  was 
derived  from  Davis  (1976)  in  the  following  form, 


Rcrit  =  — —  (4-1) 

/n 


where  n  is  again  the  number  of  independent  events  during 
the  data  record  length.   At  any  grid  point,  a  correlation  is 
considered  significantly  different  from  zero  only  if  it 
exceeds  Rcrit.   The  importance  of  these  values  of  Rcrit 
will  be  discussed  in  a  detailed  examination  of  all  the  corre 
lation  maps  in  a  following  paragraph. 

Sciremammano  (1979)  has  described  a  method  of  presenting 
correlations  which  conveniently  and  accurately  incorporate 
the  significance  of  the  correlation  relative  to  the  level 
of  artificial  correlation.   The  technique  consists  of 
normalizing  (dividing)  the  raw  correlations  by  their  large- 
lag  standard  error  (a  ) .   This  is  defined  from  the  Bartlett 


formula  [Bartlett,  1978,  p.  352]  as 


2 

a  Z  =  (  Z    C   (iAt)  C   (iAt)/N  (4-2) 

^    i=_oo  xa     yy 
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where  C^^   and  C    are  the  discrete  autocorrelation  functions 
of  the  variables  x  and  y,  respectively.   It  can  be  seen  that 
(4-2)  represents  the  ratio  of  a  measure  of  the  integral 
time  scale  of  the  variables  x  and  y  to  the  record  length  N. 
Thus,  the  normalization  will  compensate  for  the  interplay 
of  the  dominant  time  scales  of  the  input  processes  and  the 
record  length  and  will  yield  significance  levels.   Considera- 
tion of  this  interplay  can  be  crucial  when  comparing  meteor- 
ological and  oceanic  variables  which,  in  general,  have 
different  time  scales  of  variability.   In  the  present  study, 
such  a  presentation  of  the  correlations  would  essentially 
give  a  more  quantitative  estimate  of  Rcrit  than  given  above, 
and  it  would  allow  Rcrit  to  vary  over  the  North  Pacific, 
reflecting  the  fact  that  the  dominant  time  scales  of  varia- 
bility of  SST  and  u^  may,  perhaps,  not  be  uniform.   These 
calculations  were  not  made  for  this  thesis  and  would  offer 
an  opportunity  in  a  subsequent  study  to  increase  the  value 
of  the  correlation  maps.   For  the  purpose  of  this  thesis, 
a  correlation,  r,  is  considered  significantly  different 
from  zero  if  |rj  >  0.20. 

In  light  of  the  previous  comments,  further  review  of  the 
uAJ  lag  correlation  maps  and  Table  II  will  now  reveal  several 
additional  interesting  details.   All  maps  reflect  a  large 
geographic  scale  of  correlation  compared  to  the  total  area 
of  interest.   The  timing  of  the  maximum  magnitude  of  negative 
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area  (NNEG)  and  maximum  total  negative  correlation  fRNEG) 
varies  among  the  three  data  sets.   Table  II  shows  that 
u*  Fl  has  its  maximum  area  of  negative  correlations  at  the 
zero  month  lag,  while  u*JF2  and  u*  Fm  have  their  max  areas 
established  at  a  plus -one -month  lag.   The  maximum  map  sum 
of  all  negative  correlations  occurs  at  a  plus -one -month  lag 
for  u*  Fl  and  u*  F2  and  at  a  zero  month  lag  for  uA^Fm.   The 
values  of  RNEG  and  NNEG  attained  by  the  u*  F2  data  at  a  lag 
of  plus  one  month  exceeded  all  other  data  maps.   Thus,  it 
appears  the  u*   verifies  the  expectation  that  anomalous 
storminess  values,  as  reflected  by  u*JF2,  are  related  to 
anomalous  surface  cooling  with  a  time  scale  of  two  to  six 
weeks , 

It  is  interesting  to  note  that  if  only  correlations  of 
absolute  value  greater  than  0.20  are  considered  significant, 
then  most  of  the  trends  indicated  above  remain  in  effect. 
For  uA  Fl  and  u*  F2,  the  bulk  of  the  positive  correlation 
areas  are  not  significant,  while  approximately  half  of  the 
negative  correlations  (at  lag  of  +1  month  in  Figure  13)  are. 
The  result  appears  to  be,  after  considering  all  the  possible 
significance  checks  provided  previously  and  the  map  details 
delineated  above,  that  at  a  plus -one -month  lag  there  is  a 
large  mid-oceanic  area  where  u*  Fl  and  us  F2  acting  as 
anomalous  storminess  variables  produce  a  cooling  of  the  sea 
surface.   The  mechanism  responsible  for  this  cooling  most 
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likely  is  the  vertical 'mixing  (ent rainment)  of  cold  water 
from  below  the  mixed-layer. 

B.   LARGE  LAG  CORRELATIONS  BETWEEN  u*3Fl  AND  SST 

As  an  experiment  to  lend  additional  credibility  to  the 
large  shift  in  correlations  observed  at  the  plus -one -month 

lag,  an  analysis  was  conducted  by  correlating  the  SST  field 

3 
with  a  wind  parameter,  specifically  u*  Fl,  that  occurred 

12,  17,  21,  24,  29,  and  36  months  later.   It  was  expected 

that  the  correlations  obtained  from  this  analysis  would  be 

non-significant  and  offer  by  their  scale  and  magnitude  some 

support  for  the  previously  described  results.   Figure  22 

contains  the  correlation  maps,  from  top  to  bottom,  of  minus 

12,  minus  24,  and  minus  36  months'  lag  and  Figure  23  contains 

the  correlation  maps,  from  top  to  bottom,  of  minus  17, 

minus  21,  and  minus  29  months'  lag.   Examination  of  these 

3 
two  figures  reveals  that,  in  comparison  to  u^  Fl  lagged  plus 

one  month,  all  maps  have  much  smaller  areas  of  negative 

correlations,  although  some  areas  are  significant  (greater 

than  .20)  by  the  earlier  test. 

Table  IV  which  delineates  the  values  of  RNEG,  NNEG, 

3 
RPOS,  and  RSUM  for  u*  Fl  at  the  trial  lags  examined  indi- 
cates that  the  RNEG  and  NNEG  values  remain  much  below  those 

3 
values  observed  for  u*  Fl  at  zero  and  plus -one -month  lag. 

The  average  RNEG  is  -7.67  and  the  average  NNEG  is  76  for 

lags  of  minus  one  to  minus  36  months.   Both  values  lie 
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Figure    22. 


Correlation  maps  of  uA  Fl  and  SST  Anomaly 

fields  for  lags  of  minus  12,  24  and  36 

months.   The  last  closed  isoline  is  labeled 

in  some  regions  having  the  largest  correlations 
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Minus  17  months'  Lag 


Minus  21  months '  Lag 


Minus  29  months '  Lag 


140E         160  180  160  140         120W 


Figure  23.   Same  as  Fig.  22  except  for  lags  of  minus 
17,  21  and  29  months. 
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7 

considerably  below  the  values  observed  for  uA  Fl  at  a  plus- 
one  -month  lag,  where  RNEG  was  -24.96  and  NNEG  was  131. 


C.   LAG  CORRELATIONS  BETWEEN  curl  t  AND  SST 

Figures  24  to  50  are  maps  of  lagged  correlations 

between  curl7i  and  SST  over  the  same  area  as  the  previously 

■z 
viewed  correlation  maps  of  u^   and  SST.   The  layout  of  the 

maps  is  exactly  the  same  as  before  with  the  maps  progressing 
through  lags  of  minus  three  to  plus  three  months,  in  one- 
month  increments  with  Figure  24  showing  the  minus  -  three -month 
case  and  Figure  30  ending  the  series  with  a  plus -three -month 
lag  between  curl  t  and  SST.   In  this  map  series,  as  in  the 
previous  series,  SST  is  lagged  relative  to  curler.   Thus, 
the  map  with  a  lag  of  plus  one  month  shows  the  correlation 

between  the  curl  x  field  and  the  SST  field  that  occurred  one 

z 

month  later.   Each  figure  contains  three  maps,  which  are, 
from  top  to  bottom,  correlations  between  SST  and  (1)  curl^xFl 
computed  from  high-pass  wind  components  only,  (2)  curl  tF2 
computed  from  a  combination  of  high-  and  low-pass  components, 
and  (3)  curl  iFm  computed  from  unfiltered  (total)  wind  com- 
ponents, respectively.   As  before,  each  map  is  shaded  gray 
in  areas  of  negative  correlation  values. 

Similar  expectations  existed  for  this  set  of  correlation 
maps  like  those  indicated  before  for  uA  "   That  is,  since 
anomalous  values  of  wind  stress  curl  indicate  anomalous 
Ekman  pumping,  it  was  expected  that  for  lags  greater  than  or 
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Figure  24.   Correlation  maps  of  the  curl  anomaly  data 
fields  with  SST  at  a  lag  of  minus  three 
months.  The  last  closed  isoline  is  labeled 
in  some  regions  having  the  largest  correlation: 


60 


CURL    Fl 


CURL    F2 


CURL    Fm 


140E         160  180  160  140         120W 


Figure  25.   Same  as  Fig.  24  except  lagged  minus 
two  months . 
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Figure  26.   Same  as  Fig.  24  except  lagged  minus 
one  month. 
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Figure  27.   Same  as  Fig.  24  except  no  lag  between 
data  sets . 
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Figure  28.   Same  as  Fig.  24  except  lagged  plus  one  month 
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Figure  29.   Same  as  Fig.  24  except  lagged  plus  two 
months . 
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Same  as  Fig.  24  except  lagged  plus  three 
months . 


6  6 


equal  to  zero,  the  maps  would  be  mostly  covered  by  negative 
correlation  values.   Examination  of  Figures  24  through  30 

reveals  that  the  correlations  between  curl  t  and  SST  occur  on 

3 
a  smaller  geographic  scale  than  the  correlations  between  uA 

and  SST.   There  is  an  increase  in  the  amount  of  gray  shaded 
area  as  the  maps  are  viewed  from  minus  three  to  plus  three 
months  lag  with  a  maximum  occurring  at  the  zero  or  plus-one- 
month  lag.   This  increasing  gray  shading  trend  is  not  as 
strong  as  that  obsrved  in  the  uA   series  and  examination  of 
Table  V  shows  that  the  statistics  used  to  quantify  the 
correlations  do  not  approach  those  values  obtained  from 
Table  II,  the  uA   measurements.   Table  V  reveals  that  the 
max  number  of  negative  correlations  (NNEG)  is  attained  by 

the  curl  xFm  field  with  a  plus -one -month  lag.   The  maximum 

z  r  ° 

magnitude  of  totaling  all  negative  correlations  (RNEG)  is 
observed  to  occur  at  a  plus -one -month  lag  in  the  curl  tF2 
field.   Remembering  that  the  random  correlation  situation 
would  have  a  standard  deviation  of  6.5  grid  points,  both 
curl  tF2  and  curl  iFm  undergo  shifts  of  approximately  two 
standard  deviations  or  more  when  the  lag  shifts  from  minus 
one  to  zero  and  from  zero  to  plus  one  month  lag.   Thus 
there  is  a  significant  shift  away  from  the  possible  random 
correlation  case.   When  only  the  statistically  significant 
correlations  (|r|  >  .20)  are  considered,  much  smaller 
negative  correlation  areas  result  in  Figure  28.   These 
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"significant"  areas  are  small  and  scattered.   In  the  curl  xF2 

z 

field,  a  negative  correlation  area  persists  off  the  West 
Coast  of  Canada  with  a  maximum  size  and  strength  at  plus  one 
lag.   In  general,  the  significance  of  the  relationship 
between  SST  and  curler  appears  to  be  somewhat  less  than  that 
between  SST  and  u*  . 
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V.   CONCLUSIONS 

The  primary  hypothesis  that  was  tested  in  this  study 
was  that  the  sea  surface  temperature  (SST)  responds  to 
atmospheric  wind  forcing,  as  manifested  in  storms,  and  that 
the  amount  of  response  is  proportional  to  the  amount  of  wind 
forcing.   When  the  atmospheric  forcing  is  characterized  as 
an  anomalous  storminess  and  calculated  as  u*   and  curl  t, 
it  has  a  negative  correlation  with  anomalous  SST  over  most 
of  the  North  Pacific  at  a  plus -one -month  lag.   It  appears 
that  u*   is  more  strongly  related  to  SST  anomalies  than  is 
curl  t.   Thus,  vertical  mixing  may  be  a  stronger  influence 
than  Ekman  divergence  and  convergence  in  surface  cooling. 
However,  this  difference  could  be  caused  by  difficulties 
inherent  in  computing  curl  [Saunders,  1976]. 

Another  conclusion  concerns  the  compensation  required 
to  validate  the  magnitudes  of  the  map  correlations.   The 
relatively  short  length  of  record  (120  months)  necessitates 
a  calculation  of  the  artificial  correlation  and  a  signifi- 
cant reduction  in  the  computed  correlations.   Any  increase 
in  the  record  length  will  reduce  the  level  of  artificial 
correlation  for  this  type  of  study  [see  (4-1)],  but  to  make 
a  significant  increase  in  the  total  validity,  a  doubling  or 
quadrupling  of  the  total  record  length  is  necessary. 
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The  present  study  examined  the  relationships  between 
the  atmosphere  and  ocean  that  exists  throughout  the  year. 
It  is  now  well  known  [Davis,  1978]  that  many  ocean- 
atmospheric  relationships  are  strongly  seasonally  dependent. 
It  is  possible  that  if  the  data  of  this  study  were  stratified 
into  seasons,  statistically  significant  results  could  be 
obtained.   The  problem  of  artificial  correlation  would 
increase  because  of  fewer  independent  events  [n  in  Table  III 
and  (4-1)]  available  for  correlation,  but  the  actual  corre- 
lations could  possibly  increase  enormously.   For  example,  the 
relation  of  the  winter  anomalous  storminess  with  anomalous 
sea  surface  temperature  cooling  has  the  potential  to  be  much 
more  significantly  correlated  than  the  non-seasonal  relations 
developed  in  this  research. 

Whenever  one  endeavors  in  a  study  of  this  sort,  more 

questions  are  often  generated  than  answers  found.   One 

remaining  question  is,  What  can  the  wind  alone  tell  about 

the  future  state  of  the  ocean?   That  is,  perhaps  u*   and 

curler  are  sufficiently  uncorrelated  that  they  together 

may  be  more  significantly  correlated  with  SST.   It  is 

recommended  therefore  that  a  further  study  correlating  SST 

with  a  combined  form  of  u,   and  curl  t  be  done  in  order  to 

*  z 

provide  a  more  complete  answer  to  this  question.   This  will 
give  a  more  complete  estimate  of  the  total  importance  of 
the  winds,  relative  to  other  effects,  in  changing  the  SST. 
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APPENDIX 

Methods  for  representing  arrays  of  environmental 

variables  by  empirical  orthogonal  functions  have  been 

formulated  in  a  number  of  different  ways.   [See,  for  example, 

Lorenz,  1956;  Kutzbach,  1967;  Stidd,  1967;  Davis,  1976.] 

Most  of  the  information  presented  in  this  appendix  is  based 

upon  the  work  of  Kutzbach  (1967). 

Let  A   be  a  P  component  vector  which  represents  the 
n  r  r 

+■  Vi 

n    (n  =  1 .  .  .  .  ,  N)  observation  of  P  variables  and  let  A 

be  a  P  by  N  matrix,  where  the  n    column  is  the  observation 

vector  A  .   The  goal  is  to  determine  a  vector  e  which  has  the 
n       ^ 

highest  degree  of  similarity  to  all  the  A   observation 

°       °  J  n 

vectors  simultaneously.   Kutzbach  (1967)  explains  that  if 

the  similarity  is- measured  with  the  squared  and  normalized 

inner  product  between  the  observation  vectors  A   and  the 

n 

determined  vector  e,  the  problem  is  to  maximize 


fe'AlVVe'e    =    Ce'A)(e'A)'    =    e'AA'e 
ie   -  }    *     /e    e  Ne 'e  Ne'e 


after  averaging  across  all  A  .   The  prime  denotes  the  trans 

o      a  n 

pose.   This  is  equivalent  to  maximizing 


e'  Be  ,  (1-1) 
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subject  to  the  condition 


e1  e  =  1  ,  (1-2) 


where  B  is  an  m  by  m  symmetric  matrix  whose  b.     element 

ij 

is  given  by 


-1   N 
b. .  =  N     Z      A.   A.   , 
ij         t   in  in  ' 

J        n=l       J 


=  N"1  (A'A)  ,  (1-3) 


and  m  is  the  larger  of  P  and  N. 

Furthermore,  Kutzbach  (1967)  continues  by  pointing  out 
that  the  maximization  of  (1-1)  subject  to  (1-2)  leads  to 
the  equation 

Be  =  eX    .  (1-4) 

The  vector  e  and  the  parameter  X    can  now  be  recognized  as 
an  eigenvector  and  associated  eigenvalue  of  the  symmetric 

normalized  matrix  B.   It  can  be  shown  that  the  e.  are 

l 

orthogonal  and  the  X-  are  real  and  positive.   Therefore, 
(1-4)  can  be  rewritten  as 

BE  =  EL  (1-5) 
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where  E  is  the  m  by  m  matrix  of  eigenvectors  and  L  is  the 
m  by  m  diagonal  matrix  of  eigenvalues.   Note  that 


E'E  =  I  .  (1-6) 

In  the  following,  it  is  assumed  that  the  elements  of  L  and  the 

columns  of  E  have  been  arranged  such  that  e,  is  associated 

with  A.,,  the  largest  eigenvalue  of  B,  e  9  ,  is  associated  with 

\~,    the  next  largest  eigenvalue  of  B,  and  so  forth.   Combining 
(1-3)  and  (1-5),  and  noting  from  (1-6)  that  for  orthogonal 

matrices  the  transpose  is  equal  to  the  inverse,  one  obtains 

E'AA'E  =  LN  (1-7) 


defining 


E  =  E'A  (1-8) 


where  C  is  an  m  by  m  matrix,  it  follows  that 


A  =  EC  (1-9) 


and 


m 


A  =  I      c  e.  ,  n=l ,  N.  (1-10) 

n   .  ,   in  i*        '  v    J 
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Examination  of  (1-10)  reveals  that  the  observation  vector 
A   can  be  expressed  as  a  linear  combination  of  the  m  eigen- 
vectors.   If  all  m  eigenvectors  are  used,  then  the  observa- 
tion vector  A   is  exactlv  reproduced. 

n  '         r 

In  most  applications  of  empirical  orthogonal  function 
to  non-random  data,  it  is  found  that  a  large  portion  of  the 
variance  can  be  accounted  for  by  retaining  only  the  first 
few  largest  eigenvalues  and  related  eigenvectors.   Thus, 
retaining  only  the  first  few  terms  of  (1-10)  approximates 

■k 

the  observation  vector.   If  A    is  defined  as  the  aporoxi- 

n  r" 

+■  hi 

mation  to  the  n    observation  vector,  then 


* 

p 

A      E   c.  e.,  n=l, N;   P*  <  P  .      (1-11) 

n    •  ,   in  i'     '  v    J 


This  allows  truncation  of  noise  by  removing  those  terms  with 

small  variances.   These  smaller  terms  are  most  likely  to 

consist  almost  entirely  of  noise  as  shown  by  the  work  of 

Preisendorf er  and  Barnett  (1977) . 

Specific  application  of  the  preceding  is  delineated 

below,  using  SST  data,  in  an  outline  detailing  the  array 

sizes  and  IBM-360  IMSL  routines.   Note:   Only  the  trunca- 
te 
tion  limits  (the  value  of  P  )  varied  among  the  data  sets  and 

P  =  166  grid  points  with  N  =  120  observations. 
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ARRAY 


NAME 


COMPUTER 
ROUTINE 


A  (120  x  166) 


Anomalous 
Data  Set 


A'A  (166  x  120)  (120  x  166) 


Transpose 
Product 


VTPROF 


B  (166  x  166)/120 


Normalized 

Symmetric 

Matrix 


L  (166  x  166) 
E  (166  x  166) 
E  (166  x  12) 


Eigenvalues 


Eigenfactors 


Truncated 
Eigenvectors 


EIGRS 


EIGRS 


C  (120  x  12) 


Time 
Coefficients 


VMULFF 


A   (120  x  166) 


Approximated 
Data  Set 


VMULFP 
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